Varying the number of principal components for modeling sample structure
نویسندگان
چکیده
We examined the sensitivity of Coal-Map to the number of covariates used to model global and local sample structures. First, we represented the local sample structure using the top three covariates obtained after applying principal components analysis on the local partition Xl containing the test locus xj. Global sample structure was represented using the top two covariates after performing principal components analysis on the full alignment X excluding the local partition X`. This resulted in two models: one using the covariates W global j = (w1, w2) and the other using the covariates W glocal j = (w1, w2 . . . w5), respectively. We selected one of the aforementioned two models for each test locus xj using the heuristic approach described in the Methods section. In Figure S1, the performance of Coal-Map, using five covariates to represent sample structure (two for global and three for local), and EIGENSTRAT is shown using receiver operating characteristic (ROC) curves. Using Delong et al. test [1] with Benjamini-
منابع مشابه
Modeling and Forecasting Iranian Inflation with Time Varying BVAR Models
This paper investigates the forecasting performance of different time-varying BVAR models for Iranian inflation. Forecast accuracy of a BVAR model with Litterman’s prior compared with a time-varying BVAR model (a version introduced by Doan et al., 1984); and a modified time-varying BVAR model, where the autoregressive coefficients are held constant and only the deterministic components are allo...
متن کاملOn convergence of sample and population Hilbertian functional principal components
In this article we consider the sequences of sample and population covariance operators for a sequence of arrays of Hilbertian random elements. Then under the assumptions that sequences of the covariance operators norm are uniformly bounded and the sequences of the principal component scores are uniformly sumable, we prove that the convergence of the sequences of covariance operators would impl...
متن کاملبررسی ساختار جمعیتی گاوهای بومی ایران با استفاده از تحلیل افتراقی مؤلفههای اصلی
Effective management of genetic resources in the domestic animals is based on characterization of genetic structure and diversity among populations. Strategies reducing complexity and dimensions of data are required to analyze the genetic relationships between populations based on dense genomic data. The objective of this study was to use the discriminant analysis of principal components (DAPC)...
متن کاملFunctional Analysis of Iranian Temperature and Precipitation by Using Functional Principal Components Analysis
Extended Abstract. When data are in the form of continuous functions, they may challenge classical methods of data analysis based on arguments in finite dimensional spaces, and therefore need theoretical justification. Infinite dimensionality of spaces that data belong to, leads to major statistical methodologies and new insights for analyzing them, which is called functional data analysis (FDA...
متن کاملPatterns Prediction of Chemotherapy Sensitivity in Cancer Cell lines Using FTIR Spectrum, Neural Network and Principal Components Analysis
Drug resistance enables cancer cells to break away from cytotoxic effect of anticancer drugs. Identification of resistant phenotype is very important because it can lead to effective treatment plan. There is an interest in developing classifying models of resistance phenotype based on the multivariate data. We have investigated a vibrational spectroscopic approach in order to characterize a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015